OPTIMAL ADAPTIVE MULTIDIMENSIONAL-TIME SIGNAL ENERGY 



ESTIMATION ON THE BACKGROUND NOISE. 

Eugene Ostrovsky 
Department of Mathematic, Soniclynx company, 
78042, Rosh Haain, Hamelecha street, 22, Israel; 
e-mail: eugeny@soniclynx.net 

Eugene Rogover 

Department of Mathematics and Statistics, Bar - flan University, 59200, Ramat Gan, 

Israel. 

e - mail: rogovee@gmail.com 
Leonid Sirota 

Department of Mathematic, Bar - Ilan University, Ramat Gan, 52900, Ramat Gan, 

Israel, 

e-mail: sirota@zahav.net.il 
Abstract. 



We construct an adaptive asymptotically optimal in the classical norm of the space 
L(2, Vt) of square integrable random variables the Energy estimation of a signal (function) 
observed in some points (plan of experiment) on the background noise. 

Key words and phrases: Energy functional, adaptive estimations, loss function, Fourier 
series, gradient, variation. 

1. Statement of problem. Notations. Let V(n),n = 16, 17, ... be a sequence of 
a vector - valued (in general case) sets (plans of experiences) in the cube [—ir,ir] d , d = 
1,2,3,...: 

V(n) = {xi = Xi = Xi(n), }, G [-7T, n] d . 

At the points we observe the unknown signal (process, field) / = f(x), x G [— 7r, 7i] d 
on the background noise: 

y(i) = m) + <rti, (1) 

where the sequence noise {&}, is the sequence of errors of measurements, is the sequence 
of independent (or weakly dependent) centered: E£j = normalized: Var(^) = 1 random 
variables, a = const > is a standard deviation of errors. 

Let us denote by the E m the so - called Energy operator, i.e. such that the Energy 
of the signal / may be defined as follows: (at last in the case m = 0, 1, 2 ) 

W m (f) = f \\E m f\\ 2 L 2 (-n,n) d , (2) 
where we define for an arbitrary M = 1,2,... — dimensional vector z = 
(zx, Z2,..., z M ) \z\ = \jYdLx z i and for tne vector function g = g(x) = (gx, g 2 , ■ ■ ■ , 9m) 




Note that the functional 



Il/I|5(m) 



1/2 



11=0 

is the classical Hilbert's - Sobolev's norm. 

We assume that the operator E m = E m (f) is the spherical invariant homogeneous 
differential operator of order m,m — 0,1,2, For example: 

E (f) = /; E ± (f) = grad(/), E 2 (f) = A(f) = £ (4) 

i=i ox i 

Note that if m = 2r is an even number, then E m = A r . 

Note that our assumption (4) is true when the environment is homogeneous and 
isotropic. 

Our goal is to offer and investigate an adaptive asymptotically as n — > oo 
optimal in the probabilistic L 2 {Vt) sense estimation (measurement) W m (f, n) = 
W m (n) of a energy functional W m (f) = W m , based on the observations {y(i), i = 
1,2,. ..,n}. 

The probabilistic L 2 (Q) sense denotes that we consider the following loss function: 

Z(W m {f),W m {f,n)) = -E\W m {f,n) - W m {f)f . (5) 

Here W m (f, n) = W m (n) denotes the estimation of the functional W m = W m (f), adaptive 
or not. 

The considered problem appears in the financial mathematic [4], technical diagnosis 
and geophysics [3], image processing [5] etc. 

The one - dimensional case is consider in [1] - [3] etc. We notice that there are 
some essential differences between the one - dimensional and multidimensional cases; we 
will show, for example, that in the multidimensional case we need to use only the optimal 
experience design. 

On the other words, this problem is called "filtration of a signal on the background 
phone", "adaptive noise canceler" or "regression problem". 

In the one - dimensional case d — 1 this problem was considered in many publications 
([1] - [5] etc). The case d = 2 is known as "picture processing" or equally "image 
processing" . 

2. Denotations. Assumptions. Construction of our estimations. ~Let z = z = 
{zj},j — 1, 2, . . . , d, Zj G [— 7r, 7r] be a d — dimensional vector, 

F(z) = (2n)- d n(vr + Zj ), S(n) = 6(n, V(n)) = 
j'=i 

n 

sup \G n (z) - F(z)\, G n (z) = n-^Hxi < 

Z£{-TT,TT d ] i = l 

where 

I(x < z) = 1 44> Vj = 1, 2, . . . , d xj < zj, 
and I(x < z) = in other case. 



The value, or more exactly, the function 5 = 
of a sequence of the plans V(n). 
We suppose that 



6(n) = 5(n,V(n)) is called discrepancy 



5(n) = 5(n, V(n)) < C(l, d) [\og(n)} d /n, (6) 

Note that in the one - dimensional case the condition (6) is satisfied even without 
the member \og d (n) if Xi = —n + 2iri/n (the uniform plan); but in general case d > 2 we 
need to use, e.g., the classical Van der Corput or the Niederreiters sequences (experience 
design) (see [8], p. 183 - 202), for which the condition (6) is satisfied. 

Note in addition that the Niederreiters sequences allow us to elaborate the convenient 
for application sequential estimation of the energy of signal W m (f). 

Recall that for arbitrary sequences of plans {V(n)} 

5( n ) = 5(n,V(n)) > C(2, d)[\og(n)] d ~ 1 /n,n> 3. 

We define also v = 2 l > d , N dym (n) = 0.25[n 2 ^ 4m+d ^>} and for N e (1, N d>m (n)) the 
"rectangles" 

R(N) = {k : maxk(j) < N}, R 2 (N) = R([u N}) \ R(N), 

j 

R 3 (N) = {k : mmk(j) > N + 1} = Z d + \ R(N). 

Here [z] denotes the integer part of a (positive) variable z. 

Let us use the classical orthonormal trigonometric system of a functions defined on 
the circle [— 7r, it] : 

ipi(x) = (27r)~ 1/2 , ip 2 (x) = 7r _1/2 sinx, tp 3 (x) = ir~ 1/2 cosx, ip^x) = ir~ 1/2 sin(2x), 

^(x) = tx~ x I 2 cos(2a;), ipe(x) = tt^ 1 ^ 2 sin(3x), ij>?(x) = cos(3a;) 

etc. We define for the multivariate integer index k = k = (ki, k 2 , ■ ■ ■ , kd), where Vs = 
1,2,... k s > 1, 

d 

(f>(k,x) = Yl^k a (x 8 ). (7) 

s=l 

Remark 1. Note that the functions {4>%{x)} are eigen functions for energy operator E m . 
This circumstance allow us a possibility to generalize our method. 

Further, we introduce the sequence of real numbers {X(k)} as a square norms of the 
functions {ipk(')} '■ = \ \dvp k (x) / dx\\ 2 . It is easy to compute: A(l) = 0; 



A(2) = A(3) = 1; A(4) = A(5) = 4; A(6) = A(7) = 9; A(8) = A(9) = 16, . . . . 
Let us define 



A d (k) = A d (k) d = f 



d ^ 2 



i=i 



Note that 



j'=i 



The estimating function E m f(-) belongs to the space L 2 (— Tr,n) d or equally W m (f) < 
oo if and only if 

W m (f) = T, c2 (k)-[A d (k)] 2m <oo. (8) 

k 

Remark 2. The expression (8) has a sense still in the case when the number m is 
non - integer. Further we will use only the expression (8) for the value W m (f) tacking 
into account all the values m, m e [0, oo). 

We denote in the considered case W m (f) < oo 

p m (N)=p m (f,N)= £ c 2 (k)-[A d (k)} 2m . 

keR 3 (N) 

It is evident that p m (N) | as N | oo. 

The values p m (N) = p m (f, N) are known and well studied in the approximation theory, 
see, e.g., ([9]). Namely, the variable p l J 2 (N) is the error in the L 2 [— ir, n] d norm of the best 
approximation of a function / by means of d — dimensional trigonometric polynomials 
with the maximal degree less or equal than iV : 

Po(N)= mf||/ - T(iV)|| 2 , 

where T(N) is the d — dimensional trigonometric polynomials with the maximal degree 
less or equal than N; and are closely connected with a so - called Zygmund's L 2 modules 
of continuity: 

u 2 (f,5) = sup\\f(x + h) - 2f(x) + f(x - h)\\, 5e [0,1], 

\h\<5 

where the algebraic operations x ± h are understood coordinatewise modulo 2tt. 
Indeed, the following implication holds: 

p l /2 (f, N) x N~ l ~ u , I = 0, 1, 2 . . . , v G [0, 1) « 
uj 2 (f«\5)^8». 

We impose the following condition on the signal /(•) : for some j3 > 0, 

p m (f\N)<CN~ 2 ?, (9) 

or more general condition: 

hm^oo — — — < 1. 
p{N) 

This conditions are satisfied in many practical cases. 



Further, we will consider only the infinite - dimensional case, i.e. when ViV > =>- 
p m (N) > 0. Moreover, we suppose 

3he (0,1),V7V>3 =s> Pm (f,N)>C h N . (10) 
As usually, we will denote A(N) x B(N), N = 1, 2, ... iff 

< Ci = inf A(N)/B(N) < sup A(N)/B(N) = C 2 < oo. 

N>1 N >! 

2. Construction of an estimations. 

We estimate the unknown coemcients c(k) = c(k) as follows: 

n 

c(k,n) =n~ l J2v( i ) (f>(k,Xi). (11) 
i=i 

The consistent estimation a 2 (n) of the variance a 2 is offered in [1]: 

o\n)=n^±(y{t) - f^\x{{))\ (12) 

i=l 

where fn\ x ) 1S an y preliminary consistent estimation of the function /(•). 

Let us offer a following family of a "projection" estimations W m (f, n, N) of the values 
W m (f) of a view: 

W m (f,n,N)= £ [A d (k)] 2m -(c 2 (n,k) - n-V 2 (n)). (13) 

keR(N) 

PFe obtain after some calculations under conditions (9) and the following condition 

(W- 

\\f\\S(max(d,2m)) < oo : (14) 

Z(W m (f , n, N), W m (f )) x n- 1 + p 2 n (N) + n - 2 N 4m+d (15) 

3. Non - adaptive estimation. 

We suppose that the conditions (9) and (15) are satisfied. Tacking the value N as 
follows: 



N 

i.e. 



N = argmin (n~ w + n ~ l N 2m+dl2 ) , 



we obtain: 



n l/(2/3+2m+fl!/2) 



= N (J3), 



Z(W m (f,n,N Q ),W m (f)) < K(f3,m,d) max 
Note that in the case 



n -l -40/(20+2m+d/2) 



(16) 



(3 > m + d/A 



(17) 



Z(W m (f,n,N ), W m ) < K 1 (^m,d) n" 1 . (18) 

The inequality (18) show us that the estimation W m (f, n, Nq) is asymptotically opti- 
mal. But it is non - adaptive, since it dependent on the parameter (3, which is unknown 
as usually in many practical cases. 

In the next section we offer the so - called adaptive estimation, i.e. which dependent 
only on the observations {y(i)} and has again the optimal speed of convergence, but under 
more strictly conditions. 

4. Adaptive estimation. 



Let us introduce the following important functional: 

r(N) = r(N,n) = £ c 2 (k,n) [A d (k)} 2m , (19) 

keR 2 (N) 

which dependent only on the source data {y(i)}, and let us choose the following (random) 
value of harmonics: 

N = argmin r(N,n). (20) 

N<N d>ra (n) 

It may be proved that under condition (9) as in articles [3] , [4] in the sense of conver- 
gence with probability one and in the L 2 (fl) norm 

lim N/n l/ W +2m+d) = K 3 (0, m, d), (21) 

n— too 

where K 3 = K 3 ((3,m,d) is some positive finite non - random constant. 

Correspondingly in the choice of iVo, we can offer the following still adaptive estimation 
of W m (f) ■■ 

W m (f,n)=W m (f,n,N). (22) 

Substituting into the expression for the loss function Z(-,-) and using the equality 
(21), we conclude that under conditions (9), (14) 

Z(W m (f , n), W m ) x max (n" 1 , n- 4 ^^ +2m+d )) . (23) 
Notice that in the case when 

p > m + d/2 (24) 

(c.f. with the condition (17)), the adaptive estimation (22) W m (f, n) is asymptotically 
optimal: 



Z(W m (f,n),W m ) < K 4 (/3,m,d) n" 1 . 



(25) 



5. Concluding remark. 

We conclude as in the case density energy estimations [11], [12] still under condition 

Wmax(d,2m) (/) < oo : 

A. In the case when j3 < m + d/4 the optimal estimation of W m (f) is impossible. 

B. In the case 

m + d/A < (} < m + d/2 
there exists only the non - adaptive estimation with optimal rate of convergence 

C. If ultimately 

(5 >m + d/2, 

then there exists the optimal adaptive estimation for the energy value W m . 

The optimality denotes, by our definition, that the speed of of convergence is (asymp- 
totically) 1/y/n. 

D. Note in addition that all the offered in this report estimations W m (f,n,N) are 
asymptotically normal distributed with parameters correspondingly 

EW m (f,n,N) - W m ~ Pm (N), 

Var(W m (f,n,N)) x n" 1 1 1/| |5(max(rf, 2m)) + p 2 m (N) + N 4m+d /n 2 . 

The last circumstance may be used by building of confidence interval for W m (f) and 
for verifications of different statistical hypothesis. 

F. Evidently, we can use by the computing of the offered here estimations the famous 
Fast Fourier Transform (FFT) with the computation of complexity x C(d) n logn. 

E. The accuracy proof of our assertions and building of the confidence region for W m (f) 
is at the same as in the articles [3] - [5]. It used the approximation theory, theory of 
martingales, for instance, the exponential bounds in the Law of Iterated Logarithm (LIL), 
theory of Banach spaces of random variables etc. 
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